home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Aminet 24
/
Aminet 24 (1998)(GTI - Schatztruhe)[!][Apr 1998].iso
/
Aminet
/
comm
/
tcp
/
tcpdl.lha
/
tcpdl.doc
< prev
next >
Wrap
Text File
|
1998-01-30
|
21KB
|
663 lines
tcpdl V2.3b distribution January 1998
CONTENTS
========
This archive consists of the following files:
tcpdl 2.3 the tcpdl http download program
tcpdlpp 1.4 the post processor for downloaded pages
tcpdl.doc this document
urllist example urllist file
tcpdl.config example tcpdl.config file
tcpdlpp.config example tcpdlpp.config file
INTRODUCTION
============
tcpdl is a program to download files from http hosts. Files to be
downloaded are listed in a file called "tcpdldir:urllist".
tcpdlpp is a program to post process files that have been downloaded
by tcpdl. This amends the URLs within each html file so that the
downloaded pages may be used locally, while any URLs that have not
been downloaded refer to their full URL.
COPYRIGHT
=========
tcpdl is Copyright © 1996 Patrik Nilsson
Copyright © 1997 Ramjam Consultants Ltd
tcpdlpp is Copyright © 1997 Ramjam Consultants Ltd
This archive is freely distributable, but may not be included in any
commercial software collection other than Aminet without prior permission.
REQUIREMENTS
============
To use tcpdl you need a TCP/IP stack and a connection to a http host.
Both tcpdl and tcpdlpp should work on any Amiga system which supports
a TCP stack.
In use tcpdl typically requires approx 1Mb of RAM plus 500-1000 bytes
for each file to be downloaded, and tcpdlpp requires about 100K plus
100 bytes per file to be processed.
Disk space equivalent to about twice the size of the files to be
downloaded is also required.
tcpdl tries to use Xen font for the status window. If this is not
available it will use the default font, but this may lead to some
data not being readable.
SUPPORT
=======
Please report any problems, and send any suggestions or comments to:
support@ramjam.demon.co.uk
The most recent version will always be available from
http://www.ramjam.demon.co.uk/software
Major versions may also be available from Aminet.
There is no requirement to register use of this program, but anyone
doing so will be kept informed of updates.
Both programs have been optimised for the 68000 processor in order to
be as generic as possible.
KNOWN PROBLEMS
==============
tcpdl:
- doesn't handle URLs which use Host Masking.
- doesn't allow access to sites which require a userid/password.
- doesn't support the automatic download of URLs referenced by the
name attribute of the object tag. This is because the name attribute
does not always apply to a URL. Since the object tag is only supported
by IE3, and the name attribute refers to a URL that is likely to be
applicable only to IE3, this is not seen as a significant restriction.
- doesn't handle URLs containing invalid Amiga filenames as well as
it could
tcpdlpp:
- No current reported problems.
DISCLAIMER
==========
A great deal of effort has gone into making these programs as reliable
as possible. However, there is no guarantee that they will perform as
described in all cases.
These programs are used entirely at the user's own risk. No liability
can be accepted for loss of data resulting from the use of these
programs.
AKNOWLEDGEMENTS
===============
Thanks to Patrik Nilsson for the original version of tcpdl.
Thanks also to all the users of tcpdl for their comments and suggestions.
CHANGES FROM PREVIOUS RELEASE
=============================
tcpdl
-----
V2.3a -> V2.3b:
- tcpdl sent http commands that some hosts didn't like - it should now
manage to talk to hosts using any version of HTTP.
- The UPDATE flag wasn't correctly propogated to all the URLs referenced
by the files marked with UPDATE in the urllist.
This was a fundamental design flaw, and so the UPDATE flag has been
removed and replaced by an UPDATE command line option which applies
to all URLs in urllist.
- The IMG flag didn't quite do what was advertised, so it has been
amended slightly:
TEXT means all URLs that are referenced by a "href" attribute
IMG means all URLs that are referenced by an attribute other than "href"
ALL means all URLs that are referenced
- added a configuration option USER, which allows the mail address to be
used in the HTTP commands to be specified. By default tcpdl uses the
username and host name used by the tcp/ip stack. If the USER option
is specified with no mail address, then no From: header will be
sent in HTTP commands.
- tcpdl used to send the user's realname to the http host - there is no
need to do this, so now it doesn't.
- fixed the DISK-ERR message, which looked untidy.
V2.3 -> V2.3a:
- made it possible to exit tcpdl when run from Workbench
- fixed enforcer hits when urllist contains a blank line
V2.2 -> V2.3:
- added support for proxy servers
- added configuration options for connection, and htpp transfer timeouts
- added configuration option for the number of retries for failed
transfers
- tcpdl now uses memory pools for its memory allocation. This improves
the allocation times slightly, but has a dramatic effect on the
deallocation time. It also reduces the risk of memory "leakage" when
tcpdl is interrupted.
- the User Agent name has been changed to conform to RFC2068
- the tcpdl.config file is now closed as soon as it has been read
- the environment variable USERNAME is now used if USER is not set
- changed the ERROR status indicator to be more specific about the
cause of the error
V2.1 -> V2.2:
- added tcpdl.config to allow the specification of file types that are
not to be downloaded.
V2.0 -> V2.1:
- fixed problem where URLs in the urllist file which were not followed
by at least one space or tab character were ignored.
tcpdlpp
-------
V1.3 -> V1.4:
- changed the handling of URLs with a leading '/'. All URLs are now
converted to relative form, so all local links should work when
browsing off-line.
V1.2 -> V1.3:
- minor optimizations
V1.1 -> V1.2:
- improved the handling of ".." in URLs
- added optional translation of characters in URLs, by means of a new
configuration file "tcpdldir:tcpdlpp.config"
USING TCPDL AND TCPDLPP
======================
OVERVIEW:
---------
Both tcpdl and tcpdlpp expect the assign tcpdldir: to refer to a directory.
This directory is the work area for both programs.
The urllist file, containing the list of URLs to be downloaded is expected
to be in this directory. The optional configuration files, tcpdl.config
and tcpdlpp.config should also be in this directory.
When tcpdl downloads URLs it will create two directories below tcpdldir:,
DATA and HTTP. Beneath each of these, one directory will be created for
each host, and beneath each of these will be the directories and files which
are downloaded.
The HTTP directory contains the actual files that are downloaded, while the
DATA directory contains files holding information about each file downloaded.
Example:
the following directory tree shows the structure that might result from use
of the example URLs given in the "THE TCPDLDIR:URLLIST FILE" section below.
tcpdldir:
|
|
+------- urllist
|
|
+------- HTTP
| |
| |
| +------- www.ramjam.demon.co.uk
| | |
| | |
| | +------- index.html
| | |
| | |
| | +------- amiga
| | | |
| | | |
| | ... ...
| |
| |
| +------- www.cu-amiga.co.uk
| | |
| | |
| | +------- index.html
| | |
| | |
| ... ...
|
|
+------- DATA